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Abstract 

This  project  established  the  theoretical  and  computational  foundation  for  a  new  class  of  games, 
termed  as  the  multi-interdictor  network  games  (MINGs).  Such  games  naturally  arise  in  the  context  of 
various  military  and  security  applications  in  which  multiple  interdictors  with  differing  objectives  operat¬ 
ing  on  a  common  network.  Within  the  general  class  of  MINGs,  we  proved  existence  of  Nash  equilibria 
for  specific  subclasses  of  such  games,  including  the  shortest-path  and  maximum-flow  multi-interdictor 
network  games,  and  developed  scalable  algorithms  to  compute  such  equilibria.  In  addition,  we  pro¬ 
vided  theoretical  bounds  on  the  worst-case  efficiency  loss  of  equilibria  for  the  shortest-path  games,  with 
such  loss  caused  by  the  lack  of  coordination  among  noncooperative  interdictors,  and  prosed  an  empir¬ 
ical  approach  utilizing  decentralized  algorithms  to  study  the  average-case  efficiency  loss.  Finally,  we 
developed  convergent  decentralized  algorithms  based  on  the  connection  between  the  class  of  potential 
games  (to  which  the  MINGs  belong)  and  optimization  that  will  advance  the  computation  of  large-scale 
optimization  problems. 
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1  Introduction  and  Objectives 

This  research  project  is  mainly  motivated  by  the  game-theoretic  aspect  of  multi-interdictor  games.  Inter¬ 
diction  problems,  in  which  defenders  seek  to  destroy,  neutralize,  or  delay  their  enemy’s  potential  to  launch 
effective  attacks,  naturally  arise  in  a  variety  of  military  and  homeland  security  contexts,  such  as  coordinat¬ 
ing  tactical  air  strikes,  combatting  drug  trafficking,  protecting  civil  infrastructure  against  terrorist  attacks, 
defending  against  the  smuggling  of  nuclear  material,  and  preventing  attacks  on  critical  computer  communi¬ 
cations  networks.  Such  problems  have  traditionally  been  looked  at  from  a  centralized  decision-maker’s  point 
of  view;  that  is,  it  is  typically  assumed  that  there  is  one  agent  that  determines  and  implements  an  interdiction 
strategy  in  a  system.  However,  in  reality,  interdiction  strategies  are  often  carried  out  in  a  decentralized  or 
uncoordinated  fashion,  especially  when  there  are  multiple  interdicting  agencies.  One  of  the  most  prominent 
examples  may  be  the  ongoing  fight  against  the  terrorist  group  of  ISIS  (aka  ISIL  or  Daesh).  In  such  a  fight, 
multiple  interdictors  with  different  objectives  exist,  including  the  US-led  coalition,  Russian,  Iran,  and  Saudi 
Arabia-led  Arab  countries.  While  each  interdictor  has  their  own  set  of  adversaries  to  obstruct,  their  actions 
can  affect  (either  favorably  or  adversely)  the  other  interdictors’  interests  due  to  the  fact  that  they  all  operate 
on  a  common  network|]] 

Without  any  communication  or  coordination  between  the  interdicting  agents,  one  might  expect  that 
the  cost  or  effectiveness  of  the  decentralized  interdiction  strategy  is  far  from  optimal  (from  a  centralized 
point  of  view).  Yet  there  has  been  little  work  so  far  to  study  and  address  the  loss  in  efficiency  associated 
with  decentralized  decision-making  in  network  interdiction.  As  multi-agency  or  multi-national  collabora¬ 
tive  interdiction  missions  in  network-centric  warfare  environments  have  become  increasingly  common  and 
important,  we  believe  that  it  is  critical  from  a  military  perspective  to  understand  the  cause  and  remedies  for 
such  efficiency  loss.  This  project  is  exactly  motivated  and  designed  to  fill  this  knowledge  gap.  More  specif¬ 
ically,  the  planned  and  accomplished  objectives  of  this  research  are  as  follows,  with  the  detailed  account  of 
how  such  objectives  are  accomplished  provided  in  the  next  section. 

Objective  1.  Establish  theoretical  foundations  for  the  multi-interdictor  network  interdiction  games  (MINGs), 
including  proving  the  existence  and  uniqueness  of  equilibria  (or  lack  thereof),  and  establishing  theoretical 
bounds  for  the  worst-case  efficiency  loss. 

Objective  2.  Establish  scalable  computational  frameworks  for  the  MINGs  and  use  the  computational  meth¬ 
ods  to  numerically  quantify  the  average-case  efficiency  loss  of  the  MINGs  due  to  the  lack  of  coordination. 

Objective  3.  Develop  scalable  computational  methods  that  can  compute  Nash  equilibria  under  uncertainty 
for  both  MINGs  and  for  more  general  games. 

’The  network  here  is  broadly  termed,  including,  but  not  limited  to,  physical  infrastructure  networks,  financial  networks,  and 
cyber  networks. 
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2  Accomplishments 

2.1  Multi-Interdictor  Network  Games  (MINGs) 


As  stated  in  the  previous  section,  MIGNs  appeal-  to  be  new,  to  the  best  of  our  knowledge.  One  potential 
reason  for  the  lack  of  attention  paid  to  such  games  may  be  that  such  games  often  involve  nondifferentiabil¬ 
ity,  as  each  interdictor’s  optimization  problem  usually  entails  a  max-min  type  of  objective  functions.  Games 
involving  nondifferentiable  functions  are  generally  challenging,  in  terms  of  both  theoretical  analysis  of  their 
equilibria  and  computing  an  equilibrium.  While  in  some  cases  (such  as  in  the  case  of  shortest -path  interdic¬ 
tion),  a  smooth  formulation  (through  total  uni  modularity  and  duality)  is  possible,  such  a  reformulation  will 
lead  the  resulting  network  game  to  the  class  of  generalized  Nash  equilibrium  problems  (GNEPs),  in  which 
both  the  agents’  objective  functions  as  well  as  their  feasible  action  spaces  depend  on  other  agents’  actions. 
Although  the  conceptual  framework  of  GNEPs  can  be  dated  to  1950’s,  rigorous  theoretical  and  algorithmic 
treatments  of  GNEPs  only  began  in  recent  years.  Several  techniques  have  been  proposed  to  solve  GNEPs, 
including  penalty-based  approaches,  variational-inequality-based  approaches,  Newton’s  method,  projection 
methods,  and  relaxation  approaches.  Most  of  the  above-mentioned  works  focus  on  GNEPs  with  shared 
constraints;  that  is,  a  set  of  identical  constraints  appeal-  in  each  agent’s  feasible  action  set.  Such  games  are 
much  easier  to  deal  with,  both  theoretically  and  computationally,  than  non-shared-constraint  GNEPs.  A 
typical  MING,  however,  leads  to  a  GNEP  with  non-shared  constraints,  and  hence,  is  a  challenging  subject. 
Nevertheless,  by  investigating  the  special  structures  of  MINGs  and  their  different  forms  of  equivalent  for¬ 
mations,  we  are  able  to  establish  a  series  results  regarding  their  equilibrium  existence,  and  algorithms  that 
can  compute  such  equilibria. 

2.1.1  Shortest-path  multi-interdictor  games  (SPMINGs) 

While  a  MING  is  a  general  class  of  games,  the  specific  form  of  game  played  between  an  individual  interdic- 
tor  and  its  adversaries  needs  to  be  specified.  Generally  speaking,  there  are  three  specific  types  of  MINGs:  the 
shortest-path  multi-interdictor  games,  the  maximum-flow  multi-interdictor  games,  and  the  minimum-cost- 
flow  multi-interdictor  games.  Our  research  effort  initially  focused  on  SPMINGs  due  to  their  tractability,  and 
later  was  extended  to  the  other  types  of  games. 

Definition  and  Formulation.  Formally,  we  have  a  set  T  =  {1 .....  F}  of  interdictors  or  agents,  who 
operate  on  a  network  G  =  (V,  A),  where  V  is  the  set  of  nodes  and  A  is  the  set  of  arcs.  Each  agent’s  actions 
or  decisions  correspond  to  interdicting  each  arc  of  the  network  with  varying  intensity:  the  decision  variables 
of  agent  /  £  J  are  denoted  by  xf  6  X-f  c  Ml A I ,  where  X?  is  an  abstract  set  that  constrains  agent  /’ s 
decisions.  For  any  agent  /  £  T ,  let  x~f  denote  the  collection  of  all  the  other  agents’  decision  variables;  that 
IS,  Xt  '  —  ^X  ,  .  .  .  ,  X~f  ,  X^  ,  .  .  .  ,  X  F).  The  network  obtained  after  every  agent  executes  its  decisions  or 
interdiction  strategies  is  called  the  aftermath  network.  The  strategic  interaction  between  the  agents  occurs 
due  to  the  fact  that  the  properties  of  each  arc  in  the  aftermath  network  are  affected  by  the  combined  decisions 
of  all  the  agents.  In  addition  to  the  abstract  constraint  set  X* ,  we  assume  that  each  agent  /  <G  X  faces  a 
total  interdiction  budget  of  l/ .  The  cost  of  interdicting  an  arc  is  linear  in  the  intensity  of  interdiction;  in 
particular,  agent  /’ s  cost  of  interdicting  arc  (u.  v)  by  a;/,,,  units  is  c{ivx{iv.  Without  loss  of  generality,  we 
assume  that  if  >  0  and  c{iv  >  0  for  each  arc  (u,  v)  <E  ,4  and  for  each  agent  /  e  T .  The  optimization 
problem  for  each  agent  /  G  F  is  as  follows. 

maximize  9^{x^,x  f) 

subject  to  cLxL<hf ,  (1) 

(u,v)eA 
xf  e 
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where  the  objective  function  0*  is  agent  /’ s  obstruction  function,  or  measure  of  how  much  agent  /’ s  adver¬ 
sary  has  been  obstructed.  Henceforth,  we  refer  to  the  game  in  which  each  agent  /  £  T  solves  the  above 
optimization  problem  ([[])  as  a  MING.  The  obstruction  function  Qf  can  capture  various  types  of  interdiction 
problems.  Typically  0*  is  the  (implicit)  optimal  value  function  of  the  adversary’s  network  optimization 
problem  parametrized  by  the  agents’  decisions,  which  usually  minimizes  flow  cost  or  path  length  subject 
to  flow  conservation,  arc  capacity  and  side  constraints.  For  a  specific  SPMING,  each  agent  /  £  T  has 
a  target  node  t*  £  V  that  it  wishes  to  protect  from  an  adversary  at  source  node  sf  £  V  by  maximizing 
the  length  of  the  shortest  path  between  the  two  nodes.  The  agents  achieve  this  goal  by  committing  some 
resources  (e.g.  monetary  spending)  to  increase  the  individual  arc  lengths  on  the  network:  the  decision  vari¬ 
able  xl:V  represents  the  contribution  of  agent  f  £  T  towards  lengthening  arc  (u,  v)  £  A.  The  arc  length 
duv(xf  ,x~f)  of  arc  (u,  v)  £  Ain  the  aftermath  network  depends  on  the  decisions  of  all  the  agents.  In  this 
work,  we  considered  two  types  of  interdiction.  The  first  type  of  interdiction  is  continuous-,  in  particular, 
xf  -.=  {x-F  £  :  xiv  >  0  V(w,  v)  £  A}.  The  arc  lengths  after  an  interdiction  strategy  (x1, . . . ,  xF ) 

has  been  executed  are 

dUv{xX,  ...,xF)  =  d°uv  +  Y^  V(u,  v)  £  A,  (2) 

feP 

where  xuv  captures  how  much  agent  /  extends  the  length  of  arc  (u,  v).  We  assume  that  dfv  >  0  for  all 

(u,  v )  £  A. 

The  second  type  of  interdiction  is  discrete-,  in  this  case,  Xf  •=  {xf  £  RIaI  :  Xuv  £  {0,1}  V(w,  v)  £ 
A)  and  the  arc  lengths  in  the  aftermath  network  are 

duv{x1, . . .  ,xF)  =  d°uv  +  e„<,maxx{„  V{u,v)  £  A,  (3) 

J  c.T 

where  euv  £  M>o  is  the  fixed  extension  of  arc  (u,  v ).  In  other  words,  the  length  of  an  arc  is  extended  by  a 
fixed  amount  if  at  least  one  agent  decides  to  interdict  it. 

Let  pf  =  {p{ ,p^, , ptf  }  be  the  set  of  s?  —  tf  paths  available  to  agent  f  £  T .  The  length  of  a  path 
p  £  pf  is  given  by 

dp(x1,...,xF)  =  ^2  duv(x1,...,xF),  (4) 

(u,v)&p 

where  duv{xx , . . . ,  xF)  is  as  defined  in  equation  ([2])  for  continuous  interdiction,  and  as  defined  in  ([3])  for  the 
discrete  case.  The  optimization  problem  for  each  interdicting  agent  f  £  T  is  then: 


maximize 

III 

1 

min  dv(xf,x  f) 

xf 

P&Pf 

subject  to 

V  rf  rf 

/  J  uv  uv 

-O 

VI 

(5) 

(u,v)eA 

xf  £  Xf. 

Summary  of  Equilibrium  Properties  for  SPMINGs.  We  focused  on  studying  pure-strategy  Nash  equi¬ 
libria  in  this  work,  as  opposed  to  mixed-strategy  Nash  equilibria  (in  which  agents’  strategies  are  random¬ 
ized).  More  specifically,  we  attempted  to  answer  two  questions:  (1)  if  a  pure  strategy  Nash  equilibrium 
exists;  (2)  if  the  answer  to  (1)  is  yes,  is  the  equilibrium  unique. 

Equilibrium  Existence.  For  the  equilibria  of  SPMINGs  under  continuous  interdiction,  we  have  the  following 
result  (in  Publication  [PI]  listed  in  Section |4]of  this  report). 

Proposition  1.  Given  that  each  agent  f  £  T  solves  the  problem  with  dp{xf,x  f)  defined  as  in  0 
and  ([2]),  and  assume  that  the  abstract  set  X j  in  ©  is  nonempty,  convex  and  compact  for  each  f  £  T,  the 
SPMING  under  continuous  interdiction  has  a  pure  strategy  Nash  equilibrium. 
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Under  discrete  interdiction,  the  existence  of  a  pure-strategy  equilibrium  is  not  always  guaranteed  when 
different  interdictors  are  competing  against  different  adversaries.  We  have  discovered  an  example  to  illus¬ 
trate  this  situation  in  [PI],  If,  however,  the  interdictors’  adversaries  all  share  the  same  source  and  target  of  a 
network  (the  actual  topology  of  the  networks  can  be  different),  then  the  following  result  is  shown. 

Proposition  2.  Suppose  the  source  and  target  for  each  agent  in  a  SPMING  are  the  same.  Let  x*  denote 
the  optimal  solution  of  the  centralized  problem.  Then  x*  is  a  pure-strategy  to  the  SPMING  under  discrete 
interdiction. 

Equilibrium  Uniqueness  We  have  found  simple  instances  of  the  SPMING  that  multiple  equilibria  exist. 
Hence,  one  may  not  expect  to  establish  theoretical  results  regarding  the  uniqueness  of  equilibria  for  any 
MINGs.  Through  the  instances  of  the  SPMING  with  multiple  equilibria,  we  observed  that  all  agents  may 
be  worse  off  in  a  certain  equilibrium  than  in  other  equilibria.  This,  together  with  the  lack  of  uniqueness  of 
equilibria,  motivated  us  to  design  decentralized  algorithms  to  compute  equilibria  of  SPMINGs  (or  of  MINGs 
in  general)  such  that  the  entire  path  of  how  a  particular  equilibrium  is  reached  can  be  known.  In  addition, 
such  algorithms  shall  be  able  to  find  multiple  equilibrium  (for  example,  by  changing  the  starting  point  of 
running  the  algorithms).  The  development  of  such  algorithms  is  reported  in  the  following. 

Computation  of  Equilibria.  In  developing  algorithms  to  compute  equilibria  of  SPMINGs,  we  initially  fo¬ 
cused  on  SPMINGs  with  continuous  interdiction.  Within  such,  we  first  investigated  the  feasibility  of  using 
a  centralized  approach  to  solve  the  SPMING,  which  was  to  serve  as  a  benchmark  (both  in  terms  of  perfor¬ 
mance  and  in  terms  of  solution  accuracy)  for  the  later  developed  decentralized  algorithms.  For  SPMINGs 
with  discrete  interdiction,  no  known  centralized  algorithms  exist. 

A  Centralized  Algorithm.  The  term  “centralized”  here  refers  to  the  computational  methods  that  are  designed 
to  directly  compute  an  equilibrium  of  a  game  (much  like  of  finding  an  optimal  solution  of  an  optimization 
problem),  while  ignoring  how  such  an  equilibrium  is  achieved  by  the  strategic  agents.  In  contrast,  the  term 
“decentralized”  refers  to  algorithms  that  specify  how  each  agent  reacts  to  the  observed  new  information  of 
their  rivals’  past  action^}  and  an  equilibrium  may  be  reached  by  such  algorithms  through  iterative  interac¬ 
tions  among  the  agents;  that  is,  such  algorithms  try  to  mimic  the  iterative  decision-making  process  among 
the  agents. 

For  games  with  differentiability  (and  convexity),  the  prevailing  approach  for  centralized  algorithms  to 
compute  an  equilibrium  is  to  write  down  the  first-order  optimality  condition  (aka  the  KKT  condition)  of  each 
agent’s  optimization  problem,  and  then  to  stack  all  the  KKT  conditions  together  to  form  a  complementarity 
problem  (CP).  Newton-based  algorithms  have  been  developed  in  the  past  several  decades  to  compute  a 
solution  of  of  a  CP  (that  corresponds  to  the  equilibrium  of  the  game  with  differentiability  and  convexity). 
While  the  SPMING  formulation  given  in  (J5])  is  not  differentiable,  by  writing  out  the  dual  of  the  adversary’s 
shortest-path  problem,  the  SPMING  can  be  reformulated  into  a  game  with  smooth  functions,  with  each 
agent  solving  the  following  optimization  problem. 

f  f 

maximize  yJf  —  yJ  f 

xf ,  yf 

subject  to  y[  —  y[  <  duv(xf ,  x~f)  V(it,  v)  G  A, 

cLxL  <  (6) 

(u,v)eA 

xf  G  Xf, 

y{,  >  0  Vv  €  V, 

2Such  algorithms  also  need  to  specify  exactly  what  information  can  be  observed  by  each  agent  at  the  time  they  need  to  make  a 
decision. 
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The  formulation  ([6])  gives  us  some  insight  into  the  structure  of  strategic  interactions  among  agents  in  a  SP- 
MING.  Note  that  in  formulation  d6]),  the  objective  function  for  each  agent  /  G  T  only  depends  on  variables 
indexed  by  /  (in  particular,  yJf  and  yJf).  However,  the  constraint  set  for  each  agent  /  is  parametrized  by 
other  agents’  variables  x~f,  which  leads  to  a  generalized  Nash  equilibrium  problem  (GNEP). 

By  writing  out  the  optimality  conditions  of  ([6])  and  stacking  such  conditions  of  all  the  agents,  a  linear 
complementarity  problem  of  the  general  form  0  <  w  _L  q  +  Mw  >  0,  denoted  as  LCP(q,  M),  can  be 
obtained,  where  the  ‘_L’  sign  represents  the  vector  product,  q  €  Wl  and  matrix  M  e  Wixi1  are  from  input 
data,  and  w  €  M.d  is  the  vector  of  variables  to  be  found.  While  pivoting-based  algorithms,  generally  known 
as  the  Lemke’s  method,  are  well-established  to  solve  LCPs,  it  is  known  that  the  method  may  terminate  on 
a  secondary  ray  (similar  to  the  well-known  simplex  method  for  linear  programs  terminating  on  an  extreme 
direction).  However,  we  showed  in  [PI]  that  barring  from  degeneracy,  Lemke’s  method  will  always  find  a 
solution  of  the  LCP  derived  from  a  SPMING. 

Decentralized  Algorithms.  In  designing  decentralized  algorithms  to  solve  the  SPMINGs,  we  utilized  the 
best-response-type  algorithms,  also  known  as  Gauss-Seidel  algorithms,  in  which  each  agent  solves  their  own 
optimization  problem  based  on  the  information  of  other  agents’  actions  so  far  (assuming  such  information 
is  available),  and  then  passes  his/her  optimal  solution  to  the  next  agent  for  him/her  to  continue  the  iterative 
algorithm.  While  the  algorithm  idea  is  simple,  it  is  well  known  that  such  “naive”  implementation  of  a  Gauss- 
Seidel  (GS)  algorithm  may  fail  to  converge  to  equilibria.  However,  we  were  able  to  show  convergence  of 
the  unmodified  GS  algorithm  for  SPMINGs  under  discrete  interdiction  in  [PI],  as  summarized  below. 

Proposition  3.  Suppose  that  the  Gauss-Seidel  algorithm  is  applied  to  a  SPMING  with  discrete  interdiction, 
and  the  termination  criterion  \\xk  —  Xk-  1 1  <  e  is  used  with  e  <  1.  If  the  algorithm  terminates  at  Xk>  then 
Xk  is  an  equilibrium  to  the  SPMING. 

Corollary  1.  Consider  a  SPMING  with  discrete  interdiction  with  common  source-target  pairs,  and  assume 
that  the  initial  arc  lengths  d  and  arc  extensions  e  are  integral.  Suppose  that  the  Gauss-Seidel  algorithm  is 
applied  to  such  a  problem,  and  the  termination  criteria  || Xk  ~  Xk- 1 1  <  e  is  used  with  e  <  1.  Then  the 
algorithm  will  terminate  finitely  at  an  equilibrium. 

For  SPMINGs  under  continuous  interdiction,  no  such  convergent  results  can  be  achieved  for  the  unmod¬ 
ified  GS  algorithm.  Instead,  at  each  iteration,  we  let  an  agent  solves  the  so-called  regularized  version  of 
his/her  optimization  problem;  that  is, 

maximize  9f(Xf  ,X  ~ f )  —  r 

x1 

subject  to  xf  F  Ef(x~f), 

where  r  is  a  positive  constant.  Here  the  regularization  term  is  evaluated  in  relation  to  a  candidate  point  x- 
Note  that  the  point  xJ  and  the  other  agents’  decision  variables  are  fixed  when  the  problem  <[7j)  is  solved. 
We  refer  to  the  overall  modified  Guass-Seidel  algorithm  as  the  regularized  Gauss-Seidel  (RGS)  procedure. 
Though  we  were  not  able  to  prove  unconditional  convergence  of  RGS  algorithm  on  the  SPMINGs,  due  to 
the  fact  that  SPMINGs  are  GNEPs  with  non-shared  constraints,  we  nevertheless  had  the  following  results 
regarding  convergence,  as  documented  in  [PI]. 

Proposition  4.  Let  {xk }  be  the  sequence  generated  by  applying  the  RGS  algorithm  to  a  SPMING  with 
continuous  interdiction,  wherein  each  agent  solves  0.  Suppose  that  {xk}  converges  to  X-  Then  x  A  on 
equilibrium  to  the  SPMING. 

Corollary  2.  Consider  applying  the  RGS  algorithm  to  a  SPMING  with  continuous  interdiction,  with  com¬ 
mon  source-target  pairs.  Let  {xk }  be  the  sequence  generated  by  the  algorithm.  If  x  A  a  cluster  point  of  this 
sequence,  then  it  is  also  an  equilibrium  of  the  SPMING. 


XJ 


Xs 
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Numerical  Results.  We  applied  the  algorithms  discussed  above  to  study  several  instances  of  SPMINGs. 
The  decentralized  algorithms  were  implemented  in  MATLAB  R2010a  with  CPLEX  vl2.2  as  the  optimiza¬ 
tion  solver.  The  LCP  formulation  for  a  SPMING  with  continuous  interdiction  was  solved  using  the  MAT- 
LAB  interface  for  the  complementarity  solver  PATH.  Computational  experiments  were  carried  out  on  a 
desktop  workstation  with  a  quad-core  Intel  Core  i7  processor  and  16  GHz  of  memory  running  Windows  7. 

Among  the  tested  instances,  the  larger  network  cases  are  illustrated  by  Figure  [T]  and  the  corresponding 
numerical  results  are  provided  in  Table  [T] 


Figure  1:  Network  structure  for  the  larger  SPMING  instances. 


Table  1:  Number  of  iterations  and  running  times  for  the  network  in  Figure  [T| 


Continuous  Interdiction 

Discrete  Interdiction 

Decentralized 

LCP 

Decentralized 

#  Agents 

#  Iters  Runtime  (s) 

Runtime  (s) 

#  Iters 

Runtime  (s) 

5 

3 

0.0205 

0.0290 

5 

0.1776 

10 

5 

0.0290 

0.1833 

3 

0.1627 

15 

11 

0.1103 

0.7534 

3 

0.2419 

20 

5 

0.0723 

2.1106 

3 

0.3164 

25 

13 

0.2609 

4.8167 

3 

0.4005 

30 

15 

0.4070 

10.2256 

3 

0.5155 

35 

10 

0.3605 

17.7387 

3 

0.5948 

40 

41 

1.7485 

30.2382 

3 

0.7387 

45 

12 

0.6601 

48.6280 

3 

0.8794 

50 

12 

0.7981 

75.0420 

3 

1.0385 

Two  key  observations  from  our  numerical  experiments  are:  (1)  multiple  Nash  equilibria  were  indeed 
found  by  varying  the  starting  point  of  the  decentralized  algorithms;  (2)  the  RGS  decentralized  algorithm 
for  SPMINGs  with  continuous  interdiction  scales  very  well,  compared  to  the  centralized  LCP  algorithm. 
It  can  be  seen  from  Table  [T]  that  with  the  size  of  input  data  increasing,  the  run  time  of  the  LCP  algorithm 
(aka  the  Lemke’s  method)  increases  significantly  (which  is  not  a  surprise  as  the  Lemke’s  method  is  not  a 
polynomial-time  algorithm,  just  like  the  simplex  method);  while  the  run  time  for  the  decentralized  algorithm 
remains  relatively  flat. 


Efficiency  Loss  Quantification.  It  is  well  known  that  a  Nash  equilibrium  outcome  may  not  be  optimal 
from  a  system  perspective  (that  is,  not  Pareto  optimal)  due  to  the  unilateral  actions  among  the  strategic 
agents.  This  sub-optimality  of  Nash  equilibria,  broadly  termed  as  efficiency  loss,  has  been  studied  in  many 
other  game-theoretic  settings.  Most  of  such  efforts  have  been  focused  on  routing  games,  congestion  games, 
and  network  formation  games;  while  such  studies  on  facility  location  games,  scheduling  games,  and  resource 
allocation  games  also  appear  in  the  literature.  Almost  all  of  the  work  described  above  study  the  worst-case 
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inefficiency  of  a  given  equilibrium  concept.  Although  a  few  researchers  have  studied  the  average  inefficiency 
of  equilibria,  either  theoretically  or  empirically,  and  have  used  it  as  a  basis  to  design  interventions  to  reduce 
the  inefficiency  of  equilibria,  research  in  this  direction  has  not  received  much  attention.  In  this  project, 
we  focused  on  establishing  a  numerical  approach  to  empirically  study  the  average-case  efficiency  loss,  and 
comparing  it  with  the  worst-case  loss.  The  main  purpose  is  to  illustrate  that  for  certain  games,  even  the 
theoretical  worst-case  efficiency  loss  is  significant,  the  measure  may  be  too  conservative  as  the  worst  case 
may  be  caused  by  pathological  instances.  In  such  cases,  if  the  average-case  efficiency  loss  is  insignificant, 
then  no  additional  actions  may  be  needed  to  improve  efficiency,  which  would  save  time  and  money  of  any 
central  agencies.  Motivated  by  this  purpose,  we  introduced  the  definition  of  average  case  efficiency  as 
follows. 

First  consider  a  central  planner  with  a  comprehensive  view  of  the  network  and  all  the  agents’  objectives 
that  can  pool  the  agents’  interdiction  resources  and  determine  an  interdiction  strategy  that  maximizes  some 
global  measure  of  how  much  the  agents’  adversaries  have  been  obstructed.  Let  0c(x1, . . . ,  xF)  represent 
the  global  obstruction  function  for  a  given  interdiction  strategy  (x1, . . . ,  xF).  The  central  planner’s  problem 
corresponding  to  a  generic  multi-interdictor  problem  is  then: 

maximize  6c(x 1, . . . ,  xF) 

x1,...,xF 

subject  to  E  E  ^UV^UV  *E  bf,  (8) 

feT  (u,v)£A  feTF 

xf  e  Xf  V/  E  T. 

([8])  is  referred  to  as  the  centralized  problem,  and  we  focused  primarily  on  when  the  global  obstruction 
function  is  utilitarian ;  that  is, 

0c(xl, . . . ,  xF)  :=  ^  x-^). 

feT 

A  commonly  used  measure  of  Nash  equilibria’s  inefficiency  is  the  price  of  anarchy.  Formally  speaking,  let 
Mi  be  the  set  of  all  equilibria  corresponding  to  a  specific  instance  I.  (In  the  context  of  MINGs,  an  instance 
consists  of  the  network,  obstruction  functions,  interdiction  budgets,  and  costs).  For  the  same  instance  /,  let 
(.i1  , . . . ,  xF  )  denote  a  globally  optimal  solution  to  the  centralized  problem  <J8]).  Then  the  price  of  anarchy 
of  the  instance  /  is  defined  as 


p{I)  := 


9c(x1* , . . .  xF*) 
max  - — — ; - p- . 


(9) 


Let  X  be  the  set  of  all  instances  of  a  game0  If  the  worst  equilibrium  has  a  zero  objective  value  while  the 
global  optimal  value  of  the  centralized  problem  is  nonzero,  p  is  set  to  be  infinity.  In  addition  to  the  price  of 
anarchy  for  an  instance  of  a  game,  we  also  define  the  worst-case  price  of  anarchy  over  all  instances  of  the 
game  (denoted  as  w.p.o.a)  as 

w.p.o.a  :=  sup  p{I).  (10) 

rei 

In  addition  to  being  too  conservative,  the  measure  w.p.o.a  has  another  drawback  that  explicit  theoretical 
bounds  on  the  worst-case  price  of  anarchy  may  be  difficult  to  obtain  for  general  classes  of  games.  In  fact 
most  of  the  related  research  has  focused  on  identifying  classes  of  games  where  such  bounds  may  be  derived. 
In  this  work,  we  showed  how  our  proposed  decentralized  algorithms  can  be  used  to  empirically  study  the 

3It  is  assumed  implicitly  that  for  all  I  G  X,  the  set  A/j  is  nonempty  and  a  global  optimal  solution  to  the  centralized  problem 
exists.  By  convention,  p  is  set  to  1  if  the  worst  equilibrium  as  well  as  the  global  optimal  solution  to  the  centralized  problem  both 
have  zero  objective  value. 
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average-case  efficiency  loss  (denoted  by  a.e.l).  Let  X'  denote  a  finite  set  such  that  X'  C  X,  and  let  \I'\  denote 
the  cardinality  of  the  the  set  X' .  Then 

a.e.l{l')  :=  p{I).  (11) 

1  1  lev 

In  other  words,  the  average-case  efficiency  loss  is  the  average  value  of  p(I)  as  defined  in  ([2])  over  a  set  of 
sampled  instances  X’  C  X  of  a  game. 

To  numerically  study  the  average-case  efficiency  loss  of  a  SPMING,  we  first  established  a  theoretical 
lower  bound  on  the  worst-case  price  of  anarchy  of  SPMINGs,  which  is  (F  +  l)/(2  +  e),  with  F  being  the 
number  of  interdicting  agents. 

Using  the  RGS  decentralized  algorithm  to  find  multiple  equilibria  and  sampling  from  a  range  of  input 
data  based  on  the  network  structure  in  Figure  [T]  we  were  able  to  compute  the  average-case  efficiency  loss. 
The  comparison  between  such  loss  and  the  lower-bound  of  the  worst-case  price  of  anarchy  is  provided  in 
Figure  [2]  It  can  be  observed  that  for  the  particular  graph  structure  under  consideration,  we  observe  that  the 
average  efficiency  loss  grows  at  a  much  lower  rate  than  the  worst-case  efficiency  loss. 


2.1.2  Maximum-flow  multi-interdictor  games  (MFMINGs) 

Built  upon  our  experiences  with  the  SPMINGs,  we  extended  our  work  to  study  another  type  of  MING: 
maximum-flow  multi-interdictor  network  games  (MFMINGs).  Such  game  models  can  be  used  to  describe 
situations  in  which  adversaries  wish  to  maximize  the  flow  of  some  goods  between  two  nodes  on  a  network. 
Each  interdictor’s  aim  is  to  minimize  this  maximum  flow  by  means  of  taking  actions  to  reduce  arc  capacities 
on  the  network. 

Formally,  let  the  network  on  which  the  various  agents  act  be  described  by  the  digraph  G  =  ( V.  A).  Each 
arc  (i,j)  E  A  has  a  capacity  Uij.  For  agent  /  E  F,  the  adversary  wishes  to  maximize  the  flow  between 
source  node  s f  and  target  node  iJ  .  Agent  /  then  wishes  to  minimize  the  maximum  flow  s?  —  t/  flow.  Each 
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agent  /  then  solves  the  following  optimization  problem: 


(  max 

y 

yLf 

\ 

minimize 

xf 

e*(xf 

x  f)  = 

s.t. 

Y  yf 

/  y  &ILV 

~'}2yL  =  0^u^v 

vev 

v&V 

V 

yl  <  ua 

(xf,x~f),  Va  e  A\{(tf,sf)}  / 

(12) 

subject  to 

£ 

cf  xf  <  bf 

^'UV'XjUV  —  w 

(u,v)GA 


xf  G  Xf. 


The  x  variables  represent  the  actions  of  the  interdicting  agent,  with  each  agent  /’ s  interdiction  action  subject 
to  a  budget  constraint,  along  with  other  possible  constraints  summarized  in  the  abstract  set  X?.  The  y 
variables  represent  the  decisions  of  the  adversaries.  That  is,  yuv  is  the  flow  on  arc  (u,  v)  on  the  aftermath 
network. 

As  in  the  SPMING,  the  effect  of  interdiction  decisions  (x3 ,  x^3)  on  the  arc  capacities  depend  on  the 
type  of  interdiction.  If  interdiction  is  assumed  to  be  continuous  and  additive,  then  the  relationship  is  given 
as  follows: 

F 


■u 


ij 


[x3  xx  3)  =  max 


(«ij  -  Xij  )  ’  0 


/=! 


where  x f  is  restricted  to  be  component-wise  non-negative.  If  on  the  other  hand  interdiction  is  binary,  the 
arc  capacities  are  given  by 

Uij(xf,x~f)  =  v9A  1-  max  xf-). 

3  /e{t,...,F}  3 

While  under  either  the  continuous  or  discrete  interdiction,  the  objective  function  in  (fT2|)  is  not  smooth. 
Utilizing  LP  duality  and  by  introducing  an  auxiliary  variable  (r),  we  obtained  a  smooth  reformulation  for 
the  MFMING  under  continuous  interdiction.  (For  discrete  interdiction,  other  than  the  integer  variables, 
everything  else  is  smooth  in  the  reformulation.) 


minimize 

xf ,  yf ,  af ,  t 

£ 

(hi)eA 

F 

subject  to 

r><-X14 

h= 1 

T  >  0 

Pfj  +  af  —  a f  >0  V(z,  j)  G  A 

alf  ~  aif  ^  1 

Pij>  0,  V(f,j)  G  A 

Y  cf  xf  <  bf 

(u,v)eA 

Xf  G  Xf. 

(13) 


When  each  interdicting  agent  solves  <[T3|),  the  overall  MFMING  leads  to  a  GNEP,  as  other  agents’  decision 
variables  enter  into  the  first  constraint:  r  >  w.T  —  Ylh=\  xij-  The  main  difficulty  of  such  a  game  lies  in 
the  fact  that  the  objective  function  in  (fT3|)  involves  a  bilinear  term,  making  the  MFMING  an  instance  of 
nonconvex  games. 
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Nonconvex  games  (with  differentiability)  in  general  are  very  difficult  to  analyze  in  terms  of  equilibrium 
properties  due  to  the  fact  that  the  first-order  optimality  conditions  (FOCs)  are  only  necessary  conditions  for 
each  agent’s  optimization  problem.  In  addition,  the  complementarity  problems  derived  from  stacking  all 
agents’  FOCs  usually  lack  of  desired  properties  (mainly  monotonicity  of  set-valued  mappings)  to  deduce 
any  results.  Nonconvex  games  with  integer  variables  |^]  are  even  harder  to  analyze,  with  few  known  results 
in  the  literature.  Nevertheless,  we  were  able  to  show  equilibria  existence  for  a  special  class  of  MFMINGs. 
(See  [Dl]  for  details.) 

Proposition  5.  For  a  MFMING  under  either  continuous  or  discrete  interdiction,  an  equilibrium  always 
exists  if  the  source  and  target  nodes  are  the  same  for  each  agent,  i.e.  s*  =  s  and  F  =  tfor  f  =  1 .....  F. 

Other  theoretical  properties  of  MFMING  equilibria,  as  well  as  computational  approaches,  are  the  sub¬ 
jects  of  the  ongoing  research  of  Pis  and  their  collaborator,  Professor  Jong-Shi  Pang  at  University  of  South¬ 
ern  California.  We  are  working  on  to  develop  more  general  theoretical  and  computational  frameworks  for 
broader  classes  of  nonconvex  games.  Should  it  be  successful,  it  would  represent  a  major  step  forward  in  the 
field  of  computational  game  theory. 

2.2  Cost  Sharing  and  Network  Fortification 

As  a  side  project  inspired  by  the  primary  research  activities  funded  by  this  grant,  we  studied  stochastic 
linear  programming  games:  a  class  of  stochastic  cooperative  games  whose  payoffs  under  any  realization  of 
uncertainty  are  determined  by  a  specially  structured  linear  program.  These  games  can  model  a  variety  of 
settings,  including  a  scenario  in  which  players  can  cooperate  to  fortify  a  network  against  an  impending  attack 
or  natural  disaster  with  uncertain  effects  on  the  network.  We  focused  on  the  core  of  these  games  under  an 
allocation  scheme  that  determines  how  payoffs  are  distributed  before  the  uncertainty  is  realized,  and  allows 
for  arbitrarily  different  distributions  for  each  realization  of  the  uncertainty.  Assuming  that  each  player’s 
preferences  over  random  payoffs  are  represented  by  a  concave  monetary  utility  functional,  we  proved  that 
these  games  have  a  nonempty  core.  Furthermore,  by  establishing  a  connection  between  stochastic  linear 
programming  games,  linear  programming  games  and  linear  semi-infinite  programming  games,  we  showed 
that  an  allocation  in  the  core  can  be  computed  efficiently  under  some  circumstances.  The  major  results  of 
this  line  of  research  are  documented  in  [P2] . 

2.3  Bridging  Game  Theory  and  Large-Scale  Optimization  under  Uncertainty 

2.3.1  Potential  games  and  their  relationship  to  optimization 

In  the  above-described  work,  all  input  data  were  assumed  to  be  deterministic.  In  reality,  however,  uncer¬ 
tainties  are  present  in  various  aspects  of  a  decision-making  process,  ranging  from  uncertain  input  data  to 
incomplete  information  of  rivals’  actions,  in  a  game  setting.  To  extend  our  work,  we  investigated  into  games 
with  exogenously  uncertain  input  data  (namely,  the  uncertainty  of  the  input  data  are  uncontrollable  by  the 
agents)0 

For  a  generic  games  under  exogenous  uncertainty,  each  agent  /  (/  =  1,2 , ,F)  tries  to  solve  the 
following  optimization  problem 

minimize  4>f(x)  =  E  [6f(xf,  x_/;  £)] 

xf  (14) 

subject  to  xf  G  Xf. 

4Here  the  nonconvexity  refers  to  the  fact  that  even  not  considering  the  integrality,  the  remaining  game  is  still  nonconvex. 

3  A  related  but  different  concepts  of  games  under  uncertainty  is  games  of  incomplete  information.  In  such  games,  the  uncertainty 
is  mainly  due  to  the  incomplete  knowledge  of  the  agents  towards  their  rivals  in  terms  of  the  rivals’  utility  functions  and/or  feasible 
action  spaces.  Such  incomplete  information  may  or  may  not  be  caused  by  exogenous  uncertainties  uncontrollable  by  the  agents.  In 
such  games,  the  pertinent  equilibrium  concept  is  Bayesian  Nash  equilibrium  (BNE).  Developing  algorithms  to  compute  BNE  is  the 
ongoing  research  effort  of  PI  Liu. 
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By  using  the  expected  value  as  the  objective  function,  we  assumed  that  the  agents  are  risk-neutral.  We 
denote  the  game  above  as  NEP  ((/>  f ,  X A  set  of  player  decision  variables  x*  =  (xj, . . . ,  x*F)T  G  X  := 
A'  j  is  a  Nash  equilibrium  to  NEP  if  and  only  if  the  following  condition  holds: 

M*f>x*-f)  <  vyf  e  xf-  (15) 

Since  the  above  definition  is  not  different  than  that  to  a  deterministic  game,  all  the  theories  regarding 
equilibrium  existence  and  uniqueness  are  applicable  to  the  game  NEP(< cj)f,Xf)J=1  as  well.  On  the  com¬ 
putation  side,  algorithms  for  computing  an  equilibrium  of  NEP  ((/)/,  Xf)J=1  have  been  well  studied.  Most 
of  such  algorithms  rely  on  converting  the  game  into  a  (finite-dimensional)  variational  inequality  or  a  com¬ 
plementarity  problem  through  stacking  the  first-order  optimality  conditions  of  (fT4|).  Such  algorithms  are 
termed  as  the  centralized  algorithms.  In  this  research  effort,  we  were  interested  in  designing  decentralized 
algorithms  to  compute  an  equilibrium,  which  can  achieve  two  things  that  centralized  algorithms  cannot 
achieve:  (1)  the  entire  path  of  how  an  equilibrium  is  achieved  can  be  obtained  through  decentralized  algo¬ 
rithms,  which  will  provide  more  insights  on  how  agents  interact  with  each  other  in  a  game,  and  on  why 
certain  equilibria  are  preferred  than  the  others;  (2)  multiple  equilibria  may  be  found  through  decentralized 
algorithms]^]  As  a  result,  the  average-case  efficiency  loss  of  noncooperated  equilibria  can  then  be  studied 
using  the  computational-based  approach  established  in  this  research,  as  described  above. 

While  the  conceptual  ideas  of  decentralized  algorithms  (broadly  termed  that  include  any  type  of  learning- 
based  algorithms,  as  well  as  the  best-response-type  algorithms)  for  games  are  generally  easy,  and  implemen¬ 
tation  is  usually  straightforward,  the  convergence  (to  an  equilibrium),  however,  is  very  difficult  to  establish 
in  general.  As  a  result,  we  focused  on  a  specific  class  of  games,  known  as  the  exact  potential  games. 

Definition  1.  V\The  NEP(cf>f,  X f)^=1  is  said  to  have  an  exact  potential  function  P  :  Mn  — >  Wif  the  following 
holds  for  f  =  1,  . . .  ,F: 

<t>f(xf,x-f)-<f>f(yf,x-f)  =  P(xf,x-f)  -  P(yf,x-f) 

Vx/,  yf  E  Xf,  and  x_/  E  Xf. 

f'er 

f'H 

A  game  that  has  an  exact  potential  function  is  called  an  exact  potential  game. 

A  key  feature  of  potential  games  is  that  under  certain  conditions,  many  decentralized  algorithm  have 
been  show  to  be  convergent  to  an  equilibrium.  Such  a  result,  however,  has  not  been  established  (to  the  best 
of  our  knowledge)  for  potential  games  under  uncertainty.  One  of  the  major  contributions  of  this  line  of 
research  is  to  till  in  such  a  void. 

While  the  class  of  exact  potential  games  may  be  restrictive,  it  nonetheless  includes  several  very  important 
games,  such  as  the  routing  games  and  Nash-Cournot  games.  The  the  SPMINGs  and  MFMINGs  described 
above  with  their  dual  formulations  «|6])  and  (fT3|),  respectively)  are  both  instances  of  exact  potential  games  as 
well.  This  was  exactly  why  we  chose  to  focus  on  potential  games  in  this  research  effort.  In  addition,  based 
on  the  relationship  between  a  potential  game  and  a  single  optimization  problem,  the  decentralized  algorithms 
we  developed  are  also  applicable  to  solve  large-scale  optimization  problems  with  block-structure.  Such  a 
relationship  is  formally  stated  in  [P3]  and  is  presented  below. 

Proposition  6.  Suppose  that  NEP(9  f ,  X f)J=1  is  a  player-wise  convex  game  with  an  exact  potential  function 
P,  and  each  Off,  X—f,  f)  is  continuously  differentiable  for  any  x_/  E  X-f  and  for  any  (  £  B.  Then  x* 

6There  are  centralized  algorithms  that  are  designed  to  compute  all  equilibria  of  a  game,  such  as  the  works  done  by  Professor  Karl 
Shcmedders  in  Northwestern  University  and  his  collaborators.  However,  the  class  of  the  games  that  such  algorithms  are  applicable 
is  very  restricted,  and  is  very  different  than  our  interests,  which  stemmed  from  the  multi-interdictor  network  games. 

7The  defition  is  due  to  Dov  Monderer  and  Lloyd  S  Shapley.  “Potential  games.”  Games  and  Economic  behavior,  14(1):  124—143, 
1996. 
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is  an  equilibrium  to  the  NEP(0j-.  Xj)j=i  if  and  only  if  x*  E  X  is  a  stationary  solution  of  the  following 
problem: 

minimize  P(x) 

X 

F 

subject  to  x  E  X  =  Xj. 

f= i 


(16) 


Based  on  the  above  proposition,  it  can  be  seen  that  finding  a  stationary  point  of  the  following  optimiza¬ 
tion  problem  is  equivalent  to  a  game: 


minimize  E  [P(x\,  X2,  ■  ■  ■ ,  xf]  £)] 

X 

subject  to  x  =  (xi,  X2,  ■  ■  ■  ,xf)  G  X  =  X\  x  X2  x  ■  •  •  x  Xp, 


(17) 


where  Xf  E  for  each  /  =  1, . . . ,  F;  namely,  the  feasible  region  X  in  fl7j)  is  the  Cartesian  product  of  the 
sets  Xf  s.  This  problem  can  be  viewed  as  a  game  in  which  each  agent  controls  their  own  decision  variable 
xj,  while  the  objective  function  of  each  agent  is  the  same,  which  is  the  objective  function  in  (fF7j).  This  is 
easily  seen  to  be  a  exact  potential  game  since  the  common  objective  function  is  the  exact  potential  function. 


2.3.2  Decentralized  algorithms 

In  this  work,  we  also  focused  on  best-response  (BR)-type  decentralized  algorithms  (as  opposed  to  other 
learning  based  algorithms,  such  as  fictitious  play,  regret  matching,  etc).  In  addition  to  the  Gauss-Seidel- 
type  implementation  of  the  BR  algorithm,  there  is  another  way  of  implementation,  generally  referred  to 
as  Gauss-Jacobi.  Its  essence  is  to  have  all  the  agents  update  their  best  responses  all  at  once,  based  on 
the  previous  iterations’  information  (as  opposed  to  in  Gauss-Seidel  approach  where  the  agents  take  turns  to 
update  their  best-response).  The  computational  advantage  of  Gauss-Jacobi’s  approach  is  apparent  as  it  would 
allow  for  natural  parallel  computing.  The  disadvantage,  however,  is  that  Gauss-Jacobi-type  algorithms  can 
rarely  be  shown  to  be  convergent  to  an  equilibrium  (or  to  an  optimal  solution)  (making  them  only  heuristic 
algorithms),  even  if  Gauss-Seidel-type  algorithms  can  be  shown  to  be  convergent  when  applied  to  the  same 
problems.  In  this  work,  however,  we  indeed  can  rigorously  prove  the  convergence  to  an  equilibrium  of  both 
the  Gauss-Seidel  and  Gauss-Jacobi  type  algorithms,  given  that  we  add  a  regularization  term  to  the  objective 
function. 

Having  the  convergence  of  the  BR-type  decentralized  algorithms  is  not  enough,  however,  in  our  case,  as 
we  still  have  uncertainties  to  deal  with.  A  common  approach  (as  in  the  stochastic  programming  community) 
is  to  sample  from  the  uncertain  quantities,  and  then  solve  the  sampled  problem  as  an  approximation.  Such 
a  general  approach  is  termed  as  the  sample  average  approximation  (SAA)  method,  and  theories  on  the 
asymptotic  convergence  to  the  original  problem’s  optimal  solution  with  increasing  the  sample  sizes  have 
been  well  established.  However,  our  problem  is  a  game  with  a  fixed  sample  of  the  input  data.  While  it  is 
a  natural  idea  that  an  equilibrium  of  the  original  game  under  uncertainty  can  be  reached  through  finding 
an  equilibrium  of  the  sampled  game  (which  is  a  deterministic  game)  and  by  increasing  the  sample  size, 
the  proof  however  would  really  on  the  epiconvergence  theory,  instead  of  the  typical  techniques  used  in  the 
SAA  method’s  convergence.  While  we  provided  detailed  proof  of  such  convergence  in  [P3],  a  side  benefit 
of  the  proof  is  that  it  weakens  the  conditions  required  when  applying  the  general  SAA  approach  to  the 
block-structured  stochastic  programming  problem  [IT]  As  the  conditions  known  in  the  literature  require  the 


objective  functions  in  17  to  be  uniformly  convergent  with  increasing  sample  sizes,  we  showed  that  only 


epiconvergence  of  the  objective  function  is  needed. 

The  two  modified  BR-type  algorithms  are  to  be  provided  below,  followed  by  their  convergence  results. 
Notation-wise,  we  use  <ff(x)  to  denote  the  sample  averaged  objective  function  of  an  agent  /  with  a  fixed 
sample  size  N;  that  is,  with  a  fixed  sample  of  size  N,  each  agent  /  =  1 .....  F  solves  the  following 
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optimization  problem: 


minimize  fif{x)  =  Y^[df{xf,  x~f,  &)] 
f  3= 1 

subject  to  xf  G  X/. 

We  refer  to  the  (deterministic)  game  with  each  agent  solving  the  above  problem  as  SAANEP  (fif,  Xf)J=1. 


Algorithm  1  Regularized  Gauss-Seidel  (RGS)  algorithm 
Step  0:  Initialize  -  Set  x°  (x°)  J=1,  A;  ^ —  0. 

Step  1 :  Termination  Check:  IF  xk  satisfies  termination  criteria,  THEN  STOP 
Step  2:  Main  Iteration: 

FOR  /  =  1, . . . ,  F,  let  xk+1  solve 


minimize 

xf 


fc+i 


subject  to 

Step  3:  Update:  xk+1  =  (xk+1)J=1. 


-r,  <rk  4-  T- 

5  /_1  ’  X/+l  5  *  *  *  ?  F  )  ' 

xf  G  X/. 


Xf  —  Xf 


(18) 


Theorem  1.  (Convergence  of  Algorithm]!!  [P3])  Suppose  NEP(6  f  (■,£),  X  /)|=1  A  on  era  <7  potential  game 
and  is  player-wise  convex  for  each  f  G  ST  /«  addition,  each  Off,  X-f,  f)  is  continuously  differentiable  for 
any  £  X_#  and  for  any  (  £  E.  77?e/z  the  following  statements  hold  true: 


1.  Every  limit  point  x  of  the  sequence  xk  generated  by  Algorithm's  an  equilibrium  to  SAANEP  (<ff,  X  f)J=\ 


2.  Suppose  further  that  the  family  of  approximate  functions  { 0/  }  ^  |  multi-epiconverges  to  { A>/ } /—  i 

on  the  set  X.  In  this  case,  if  x  x*  as  N  — >•  oc,  then  x*  is  an  equilibrium  of  the  original  game 
\F 


NEP(0f,Xf)f=1. 


Algorithm  2  Regularized  Gauss-Jacobi  (RGJ)  Algorithm  for  SNEP (9f,Xf)J=1 

Step  0:  Initialize  -  Set  X  >  0,  r  >  0,  x°  <—  (Xf)J=1  G  X,  k  «—  0. 

Step  1 :  Termination  Check:  IF  xk  satisfies  termination  criteria,  THEN  STOP 
Step  2:  Main  Iteration: 

FOR  /  =  1, . . . ,  F,  let  xk+1  solve 

minimize  (j>f(xf,xlff)+T 

Xf 

subject  to  xf  G  Xf. 

Step  3:  Update:  xk+x  =  ( xk+1)f=1 . 


Xf  —  Xf 


f  191 


Theorem  2.  (Convergence  of  Algorithm  jJj  [P3])  Suppose  the  NEP(Q  ff ,  £) ,  X f)J=i  satisfies  all  the  condi¬ 
tions  in  Theorem  |7j  In  addition,  for  each  f  =  1, ...  ,F,  X  (■ ,  f)  is  Lipschitz  continuous  for  each  f  G  Q, 
and  t  is  chosen  such  that 


2  cT 


min  inf  cTf  (x)  >  Lxjp, 

f&F  x&X  * 


(20) 


where  cTf  is  the  coefficient  of  strong  convexity  ofP(xf,  x>ff)+T 
hold  true: 


xf 


2 

.  Then  the  following  statements 


14 

DISTRIBUTION  A:  Distribution  approved  for  public  release 


1. 

2. 


Every  limit  point  x  of  the  sequence  xk  generated  by  A  l go  ri  thin  is  an  equilibrium  to  SAA  NEP(dj.Xj)  j_  { 


Suppose  further  that  the  family  of  approximate  functions  multi-epiconverges  to  {(j)f}^=1  on 

the  set  X.  In  this  case,  if  x  — >  x*  as  N  — >  oo,  then  x*  solves  SNEP(0f ,  Xf)J=1. 


Numerical  Implementation.  We  applied  both  the  RGS  and  RGJ  algorithms  to  various  instances  of  two 
classes  of  exact  potential  games:  Nash-Cournot  games  and  self-routing  games.  The  detailed  accounts  of 
the  game  setup  and  input  data  are  provided  in  [P3]  and  [Dl].  Several  important  observations  from  the 
numerical  experiments  are  as  follows:  (1)  multiple  equilibria  indeed  were  found  when  applying  the  RGS 
algorithm  to  self  routing  games;  (2)  the  performance  improvement  (in  terms  of  convergence  time)  with 
parallel  implementation  of  the  RGJ  algorithm  is  close  to  be  linear  scaling;  that  is,  let  M  denote  the  number 
of  CPUs  used  to  carry  out  each  iteration  in  Algorithm  [2j  then  the  overall  convergence  time  of  the  RGJ 
algorithm  is  approximately  T /M,  where  T  is  the  run  time  when  M  =  1;  (3)  both  the  number  of  iterations 
and  the  overall  convergence  time  are  sensitive  to  the  value  of  the  regularization  parameter  r,  with  the  bigger 
the  r  is,  the  worse  the  overall  performance  is. 
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4.2  Journal  Publications 


[PI]  H.  Sreekumaran,  A.  R.  Hota,  A.  L.  Liu,  N.  A.  Uhan  and  S.  Sundaram.  Multi-Agent  Decentralized 
Network  Interdiction  Games.  To  be  submitted  to  European  Journal  of  Operations  Research ;  preprint 
available  at  http  :  / /arxiv.  org/abs/1503 .01100 

[P2]  N.  A.  Uhan.  Stochastic  linear  programming  games  with  concave  preferences.  European  Journal 
of  Operational  Research,  243(2)  637-646,  2015. 

[P3]  H.  Sreekumaran  and  A.  L.  Liu.  Bridging  Game  Theory  and  Large-Scale  Optimization:  The 
Case  of  Potential  Games  under  Uncertainty  and  Expected- Value  Optimization.  In  preparation.  To  be 
submitted  to  SIAM  Journal  on  Optimization. 

[P4]  H.  Sreekumaran  and  A.  L.  Liu.  Equilibrium  Wind  Hedge  Contract  Structures  through  Nash 
Bargaining.  To  be  submitted  to  Operations  Research. 

[P5]  R.  Chen,  H.  Sreekumaran,  and  A.  L.  Liu.  Decentralized  Algorithms  to  Compute  Nash  Equilibria 
under  Uncertainty:  The  Case  of  Nash-Cournot  Games.  In  preparation.  To  be  submitted  to  Operations 
Research. 

4.3  Conference  Presentations  and  Seminars 

[CS1]  A.  L.  Liu.  On  potential  games  and  generalized  Nash  equilibrium  problems.  Workshop  on 
Complementarity  and  its  Extensions.  Singapore.  December  2012. 

[CS2]  H.  Sreekumaran.  Multi-agent  network  interdiction  games.  INFORMS  Annual  Meeting.  Min¬ 
neapolis,  MN.  October  2013. 

[CS3]  H.  Sreekumaran.  Potential  games  with  exogenous  uncertainty.  INFORMS  Annual  Meeting. 
San  Francisco,  CA.  November  2014. 

[CS4]  A.  L.  Liu.  Convergent  decentralized  algorithms  for  certain  Nash  equilibrium  problems.  Sem¬ 
inar,  Daniel  J.  Epstein  Department  of  Industrial  and  Systems  Engineering,  University  of  Southern 
California,  September  15,  2015. 

[CS5]  H.  Sreekumaran.  Distributed  algorithms  for  games  under  exogenous  uncertainty.  22nd  Inter¬ 
national  Symposium  on  Mathematical  Programming,  Pittsburgh,  PA.  July  2015. 

[CS6]  A.  L.  Liu.  Decentralized  algorithms  for  block-structured  stochastic  programs  and  potential 
games  under  uncertainty.  INFORMS  Annual  Meeting,  Philadelphia,  PA.  November  2015. 

[CS7]  A.  L.  Liu.  Convergent  decentralized  algorithms  for  certain  Nash  equilibrium  problems.  Semi¬ 
nar,  Department  of  Industrial  and  Systems  Engineering,  Lehigh  University,  December  2,  2015. 
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